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Abstract. We demonstrate existence of a tile assembly system that self-assembles the statistically self- 
similar Sicrpinski Triangle in the Winfree-Rothemund Tile Assembly Model. This appears to be the 
first paper that considers self-assembly of a random fractal, instead of a deterministic fractal or a finite, 
bounded shape. Our technical contributions include a way to remember, and use, unboundedly-long 
prefixes of an infinite coding sequence at each stage of fractal construction; a tile assembly mechanism 
for nested recursion; and a definition of "almost-everywhere local determinism," to describe a tileset 
whose assembly is locally determined, conditional upon a zcta-dimcnsion zero set of (infinitely many) 
"input" tiles. This Icist is similar to the definition of randomized computation for Turing machines, in 
which an algorithm is deterministic relative to an oracle sequence of coin flips that provides advice but 
does not itself compute. Keywords: tile self-assembly, statistically self-similar Sierpinski Triangle. 

1 Introduction 

It is now more than a decade since Winfree [15] first noted that DNA nanostructures created by 
Seeman [12] could be mathematically abstracted as tiles on the integer plane. Since then, theorists 
and practitioners have pursued several research directions in algorithmic nanotile self-assembly, 
including: the minimum number of distinct tile types needed to build finite structures [11], tech- 
niques to minimize molecular binding errors [2] [13], reduction of complexity by using randomness 
and building the target shape with "only" high probability [1] [5], and exploring the absolute the- 
oretical limits of the Winfree-Rothemund tile assembly model [7] [9]. While varied in focus, this 
research shares a common characteristic: the purpose of each tileset is to build a specific shape, 
predetermined ah initio. Macroscale computers, by contrast, receive information, store that infor- 
mation, and take steps adaptively, based upon the information received. The main contribution of 
this paper is to demonstrate existence of a tileset that (in theory) does the same thing at the nano 
scale. 

"Tileset" in this context means a finite set of unique types of four-sided tiles. We assume there 
are infinitely many tiles of each tile type, and these tiles combine "in solution" in a nondeterministic, 
asynchronous manner. Tiles with complementary sides can bind to one another and form stable 
configurations through a random process of self-assembly. 

We show there is a finite tileset that self-assembles the statistically self-similar Sierpinski Trian- 
gle in the standard Winfree-Rothemund tile assembly model. The statistically self-similar Sierpinski 
triangle (sssST) is an infinite fractal structure that generalizes the well-known discrete Sierpinski 
Triangle. The sssST is built in stages, with a random "three-sided" coin flipped at each stage. The 
construction of the next stage depends on the entire history of ternary coin flips. We model this in 
tiles by starting each new stage with a "decision point" that permits the binding of one of three 
different tile types, each with 1/3 binding probability. The identity of the tile type that binds is then 
propagated through the structure so the history of previous stages is available to all future stages. 
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In this way, a bounded number of tile types can produce a structure with unbounded memory — by 
using the increasing size of the tile configuration to transmit progressively more information. 

Winfree was the first to construct a tileset that self-assembled into a fractal structure, when 
he showed that seven tile types computing the XOR operation would build the discrete Sierpinski 
Triangle [15]. He implemented that tileset in wetlab experiments [16]. In recent theoretical work, 
both Patitz and Summers [10] and Kautz and Lathrop [6] have presented methods to construct 
tilesets for broad classes of self-similar fractals. The current paper appears to be the first work to 
consider self-assembly of a fractal that is statistically self-similar, not deterministically self-similar. 
In fact, we need to extend the current definitions in the literature in order to formalize the main 
result of this paper, which is, "There is a tile assembly system that computes the statistically 
self-similar Sierpinski Triangle." We make that statement rigorous in Section 3.3. 

Becker et al. [1] were the first to define a tile assembly model in which different tile types were 
assigned different concentrations "in solution," so a particular location could bind to any of several 
tile types, each tile type with its own binding probability. Kao and Schweller subsequently refined 
this concentration model [5] , and Doty has recently extended their work [3] . Each of those papers 
focuses on randomized algorithms to build an n x n square for finite n. 

Our overall motivation for the current paper is to build a theory of randomized computation for 
self-assembling networks considered as evolving, distributed systems. We believe randomization will 
be an important tool for solving problems of fault tolerance in self-assembly, much as randomized 
algorithms have been able to overcome impossibility barriers faced by deterministic distributed 
systems that admit faults. 

The rest of this paper is structured as follows. In Section 2, we provide background in tile self- 
assembly, and the statistically self-similar Sierpinski Triangle. In Section 3, we present the fractal 
construction. Section 4 concludes the paper, and suggests a direction for future research. 

2 Background 

2.1 Tile assembly background 

Tiles have four sides (often referred to as north, south, east and west) and exactly one orientation, 
i.e., they cannot be rotated. A tile assembly system T is a 5-tuple (T, a, U, r, R), where T is a finite 
set of tile types; a is the seed tile or seed assembly, the "starting configuration" for assemblies of 
T; T : T X {N, S, E, W} ^ 17 x {0, 1, 2} is an assignment of symbols ("glue names") and a "glue 
strength" (0, 1, or 2) to the north, south, east and west sides of each tile; and a symmetric relation 
R C. E X E that specifies which glues can bind with nonzero strength. In this model, there are no 
negative glue strengths, i.e., two tiles cannot repel each other. 

A configuration of T is a set of tiles, all of which are tile types from T, that have been placed 
in the plane, and the configuration is stable if the binding strength (from r and R in T) at every 
possible cut is at least 2. An assembly sequence is a sequence of single-tile additions to the frontier 
of the assembly constructed at the previous stage. Assembly sequences can be finite or infinite in 
length. The result of assembly sequence a is the union of the tile configurations obtained at every 
finite stage of "a . The assemblies produced by T is the set of all stable assemblies that can be built 
by starting from the seed assembly of T and legally adding tiles. An assembly of T is terminal if 
no tiles can be stably added to it. 

Intuitively, tiles bind stably exactly when they share complementary sides with the same labels 
and glue strengths, as long as the total binding strength adds up to at least 2. Figure 1 shows how 
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a tile is represented graphically, and Figure 2 provides an example of four tiles binding to form a 
stable configuration. 



The north side has glue type 
"IS&C-H" and binding strength 2, 
represented by a double line. 



The west side has 
binding strength 0,- 
represented by a 
dashed line. 



1S&C-I1 
1S&C 



The east side has glue 
_type "l#-ir'' and binding 
strength 1 , represented 
by a single line. 



The south side has 
glue type "IS&C" and 
binding strength 1 . 



Fig. 1. An example tile with explanation. 



2sa.c-ii 

1 2C-V. 



(ii) 



2S&C-I1 
II 2C-I1 



2S&C-I1 
11 2C-11 



(iii) 



(iv) 



Fig. 2. An example of how tiles bind in a system of temperature 2. In (i), a seed tile (at left) is approached by a tile 
with a strength 2 bond on its west side, and that glue matches the strength two glue of the east side of the seed tile. 
In (ii), a tile approaches from the north, and binds with strength 2 on a single side. In (iii), we see an example of 
cooperation by two different strength one glues, in order to provide a strength 2 bond to the approaching tile. The 
final, stable, tile configuration appears in (iv). In this example, three tiles are colored blue, and one is colored white. 
It is the differing colors of the tiles that give rise to a self-assembled "shape." 



2.2 Statistically self-similiar Sierpinski Triangle 

To build the discrete Sierpinski Triangle in the first quadrant of the plane, place a single colored tile 
at the origin (stage 0), then colored tiles immediately north and east and a colorless tile northeast 
(stage 1). More generally, at stage n + 1, copy the tile configuration obtained at stage n to the north 
and east, and place colorless tiles to the east and north of those copies, to fill in a square of 2" x 2" 
tiles. Construction of the discrete sssST generalizes this process. Instead of always selecting the 
northeast area as the area to mask out, at each stage n > 0, select one area (north, northeast, or 
east) uniformly at random, mask out the area selected, and copy the configuration achieved at stage 
n — 1 to the unselected areas. (A more formal definition of the sssST appears in the Appendix.) 
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In what follows, we will refer to the north, northeast and east areas as Area 1, 2, and 3, 
respectively. Figure 3 illustrates the first three stages of construction of the sssST for a particular 
coding sequence. We now define formally what it means for a tile assembly system to self-assemble 
the sssST. Intuitively, we want all possible configurations of the sssST to be achievable, only sssST 
configurations to be achievable, and each configuration to be achievable with equal likelihood. 
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Fig. 3. The start and first tiiree stages of construction of (one possible configuration of) tiie statistically self-similar 
Sierpinski Triangle. Each stage of construction depends on the coding sequence prefix of the previous stage. The 
"traditional" Sierpinski Triangle has coding sequence (22222 • ■ • ) 



Definition 1. Let T be a tile assembly system. H^e sa?/ T self-assembles the statistically self-similar 
Sierpinski Triangle if the following conditions hold: 

1. For every infinite ternary coding sequence S G {1,2,3}°°, there is an assembly sequence 'a of 
T whose terminal assembly is the configuration of the sssST with coding sequence S. 

2. If A is a terminal assembly of T , then there is a sequence S £ {1,2,3}°° such that A is the 
configuration of the sssST with coding sequence S. 

3. For any finite coding sequence prefix a G {1,2,3}"' of length n, the probability that a terminal 
assembly of T will be a configuration of the sssST whose coding sequence extends a, is 3~". 



3 Fractal self-assembly 

3.1 The first four stages of construction 

We will build the sssST from a single seed tile, using a tile assembly system with temperature 2. It 
will be important for the proof of correctness that each tile binds with exactly strength 2, as that 
is a necessary condition for local determinism (a condition defined in Section 3.3 that guarantees a 
tile assembly system has a unique terminal assembly). There are finitely many distinct tile types, 
whereas the length of the coding sequence of the structure we wish to build increases without bound. 
So it is impossible to send the entire coding sequence to one location — but it is also unnecessary. 
To know which color tile should bind at a location, it suffices to send one bit, which communicates 
whether the location is an element of the sssST. 
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Lemma 1. There is a tileset that self- assembles the first two stages of the sssST such that (1) 
each tile hinds with strength exactly 2; (2) the first two decisions of the coding sequence can be 
transmitted to each location in advance of a tile being placed there, so the properly colored tile is 
placed each time; and (3) once assembled, the configuration transmits north the sequence "decision 
1, decision 2, decision 1, " and transmits the same sequence east. 

Proof. "By inspection" of Figure 4. (More formally, the Appendix exhibits a set of 84 tiles that 
implements the template in Figure 4, and a specific example of how they combine to realize one 
coding sequence.) Each location in the 4x4 square is colored ON or OFF, determined completely 
by the values of decisions 1 and 2, and the offset of the location from the seed tile in the southwest 
corner of the square. Figure 4(i) shows an order of tile placement, such that each tile in the square 
binds to the pre-existing configuration with strength 2. Figure 4(ii) exhibits a flow of information 
from the two decision points to each location, in a way that is consistent with the order of placement. 
Therefore, it's possible to construct a tileset that correctly colors each of the 4x4 squares, for any 
values of decisions 1 and 2. 



Order of placement 



Information transfer 



^ \ 



n 1 Decision 2 



eci; Decision 2 



Decision 1 Decisioi 



Fig. 4. Two views of the construction of the first two stages of the sssST. On the left is the order the tiles are placed 
in the plane, as they bind to the configuration. The initial seed tile is located at the lower left. On the right is the 
way information is transferred to each tile (to determine the color of the tile at that location) and outward from the 
configuration, so the coding sequence is available to later stages. 



We now extend the 4x4 construction to a 16 x 16 construction, i.e., the first three stages of the 
fractal. 

Lemma 2. There is a tileset that self- assembles the first four stages of the sssST (i.e., a square 
of 16 X 16 tiles, starting at the seed) such that (1) each tile binds with strength exactly 2; (2) the 
properly colored tile is placed each time; and (3) let A, B, C and D represent the values of coding 
decisions 1, 2, 3 and 4, respectively; then, once assembled, the configuration transmits north the 
sequence {ABAC AB AD ABAC ABA) (reading from west to east), and transmits the same sequence 
east (reading from south to north). 

Proof. We can design a tileset that satisfies (1) and (2), using techniques similar to the tileset used 
in the formal proof of Lemma 1. To see that (3) is true, consider Figure 5(i). The transmission of 
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decision information is consistent with the statement of the Lemma, and, as before, we can verify 
by inspection that the graph of information transfer is consistent with the actual placement of tiles 
in the plane, as shown in Figure 5(ii). 
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Fig. 5. Two views of tlie first four stages of construction of tlie sssST (and tlie beginning of the fiftii). The decision 
tiles are red, the crawlback tiles are magenta (light purple), and the popup tiles are purple. In (i), the red arrows 
show the flow of information from decision points to the frontier sequence F4. The first decision- value is labeled "A," 
the second "B," and so forth. In (ii), the black arrows indicate the order of tile placement as the recursion progresses. 



Even though most of the recursion flows from west to east, and from south to north, when a new 
decision tile (colored red in Figure 5) binds, it contains the information of which area should be 
masked, which needs to be conveyed to all three areas. Regardless of where the decision location is 
placed, in order to get to the "bottom left" of some area, to communicate the new information, it is 
necessary to place tiles "out of order," without having full knowledge of the coding sequence prefix. 
(This is a bandwidth problem: any location can receive only a constant-size amount of information, 
and the coding sequence grows without bound.) Tiles placed in that way are crawlback tiles, and 
bind from west to east, or north to south, in our construction. 

To ensure binding of crawlback tiles of the correct color (i.e., colored correctly in or out of the 
sssST configuration we are trying to build), we pass an additional bit northward along each column, 
and eastward along each row, that communicates whether the crawlback tile in that column/row 
is ON or OFF. 

Figure 5(ii) shows how, once enough tiles bind to create a square of dimension 2" x 2^ for 
n G {2,3,4}, a popup tile (colored blue) binds at the northeast corner of the square (due north of 
the crawlback tiles) with strength 2 on a single side, to start the process of assembling the next 
stage of the fractal. The popup tile also computes and transmits the initial value of the masked 
flag, as we will see in the next subsection. 

3.2 The main recursion 

We now show that a tileset exists that executes Algorithm 1, when assembling at temperature 2. 
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Algorithm 1 Main algorithm to self-assemble the sssST 

Require: Current location is {x, y), and is colored ON iff (a;, y) is in all versions of the sssST that extend the coding 
sequence prefix decided so far. 
1: if (0 < X < 4) or (0 < y < 4) then 

2: use the 4x4basecase method to determine which tile to place at (x, y) 

3: else if crawlbackeastwest or crawlbacknorthsouth then {either the east or north neighbor reports {x, y) should 

contain a crawlback tile} 
4: use the crawlback method to determine which tile to place at (a;, y) 
5: else if masked then 
6: color (x, y) OFF 
7: else 

8: color [x, y) ON or OFF, depending on the one or two coding sequence trits received from neighbors, and the 

relative location of (a;, y) in the 4x4 square currently being assembled 
9: end if 



Lemma 3. Line 1 of Algorithm 1 can he executed by a tile assembly system. Further, there is a 
well-defined basecase method that will correctly lay out tiles for any < a; < 4, or < y < 4. (So 
a tile assembly system can execute Line 2 as well.) 

Proof. Wc can send a Boolean flag north in each column, and the flag is only TRUE in the column 
X = 4; similarly for the row j/ = 4. By slightly altering the tileset that performs the layout in 
Figure 4, we obtain the ability to tile the westmost four columns (and, by rotation and reflection 
of glue types, the southmost four rows) of the sssST. The Appendix discusses this in more detail. 

Lemma 4. Lines 3 and 4 of Algorithm 1 can be executed by a tile assembly system. 

Proof. As already discussed, by appending the value of one bit to eastern or northern glue names, 
as appropriate, it is possible to place a crawlback tile of the appropriate color at location (x, y). We 
only need finitely many distinct crawlback tile types, as each one is responsible for transmitting 
only one coding sequence trit, and, if the tiles count modulo 4 as they are placed, they can transfer 
relative location information needed in line 8 of Algorithm 1. Further implementation details appear 
in the Appendix. 

Lemma 5. It is possible to execute Line 5 of Algorithm 1 (i.e., to compute the masked flag) in 
tile assembly. 

Proof. Consider the left side of Figure 6. It shows how the delimeters interact with the "area-border 
columns" (i.e., columns whose x coordinate is 2^ for some G N), and their corresponding "decision 
columns" (where x = 2^^ -|- 1). Each decision column toggles between two states: a POP state, and 
a WAIT-TO-POP state. If the delimctcr flag gets transferred east, through the eastmost column, 
to a location (a, h) receiving a WAIT-TO-POP flag from the south, then construction for that area 
is completed, and we place a tile at that location with strength one on all four sides. However, if 
(a, h) is receiving POP from the south, then we place a tile at (a, h) with a strength 2 glue on the 
north side, so a popup tile can attach to it, and start construction of the areas to the north and 
northeast. We compute the value of masked with constant-size messages as follows. A decision tile 
determines which of the new areas to build will be masked completely, and transfer that to the 
rest of its column, including the popup tile. Prom the popup tile, the correct value of masked is 
sent west and east, to build Areas 1 and 2. If we are building an unmasked area, we set masked 
to TRUE if, when laying the crawlback tiles, we see a coding trit in a former decision column that 
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33 western tiles of the norttiern border of Stage 7 (64 x 64 square) 




The first 33 x 33 tiies of the setf-assembly (seed tiie at southwest corner). 
Crawibacl< tiies are magenta, popup tiies are biue, and popright tiies are green. 
The red tiies are the (randomiy chosen) decision tiies, and the red arrows show 
transmission of the "decision fiag" from coiumn to row. 



Fig. 6. Illustration of the interaction between delimeters, area-border columns (columns of form x — 2*°), and decision 
columns (columns of form x — 2^ + 1). The left diagram shows how a delimeter starts at a popup tile and climbs 
northeast until it encounters an area-border column, then connects with the decision column due east. If the decision 
column is in state POP (shown with a "p"), we place a popup tile due north of the connection point, and toggle 
the column to state WAIT-TO-POP (shown with a "w"). If the column is in state WAIT-TO-POP, then we know 
that we have built the current area to its maximum height, and the next row should be one of crawlback tiles. The 
right diagram shows the skeleton of the nested recursion that builds each area at each stage. In addition to the 
existing column and delimeter mechanisms, we also use a row-and-popright-tile mechanism, obtained by rotating the 
original mechanisms 90° to the right, and reflecting them from bottom to top. The diagram shows how the popup 
and popright and tiles interact, with delimeters invisible for clarity. Note the one asymmetry of the recursion is the 
red "decision flag" transmitted from column to row, so a decision tile binds to the configuration at the start of each 
new stage. 
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implies that area should be masked off. Once masked is set to TRUE, it remains true for all other 
contiguous crawlback tiles. Because of how the coding sequence prefix is laid out, as we lay the 
crawlback tiles we can compute the correct value of masked, and build until the delimeter and 
popup (or popright) mechanisms inform us we have come to the end of the masked area. 

Lemma 6. Line 8 of Algorithm 1 can he computed by a tile assembly system. 

Proof. The right side of Figure 6 exhibits a nested recursion by popup and popright tiles that 
delimit progressively smaller areas, down to 4 x 4 squares. As the construction of 4 x 4 squares 
is hardcoded into the construction, and the coding sequence prefix is transferred so masked can 
be computed correctly, and the first two decision values are available, from either the west or the 
south, when deciding how to color an unmasked 4x4 square, we can color each square correctly. 
Finally, as the construction builds one 2^ x 2^ stage at a time, we will tile every location in the 
first quadrant. 

Theorem 1. There exists a tile assembly system that self-assembles the statistically self-similar 
Sierpinski Triangle. 

3.3 The tileset computes 

To show that a tileset computes the sssST, we need to prove that the tileset is actually "doing the 
work," and not receiving "too much help" from the randomly chosen decision tiles {e.g., the set of 
decision tiles do not themselves fill the plane) . We achieve this by applying zeta dimension. 

Definition 2. Let S C Z x Z be a set of locations in the integer plane. Define the S-zeta-function 

(s '■ [0,oo) [0,oo) by Csis) = X^(o o)^(a; 2/)eS (^V + 2/^) define the zeta dimension of S, 

written dim^(5), as follows: dim^(S') = inf{s | Cs{s) < oo}. 

Zeta dimension is a natural measure of the "size" of a set of discrete lattice points, and has been 
rediscovered (and renamed) by many researchers, since the nineteenth century. Discussion of this 
history, and a survey of some classical and computer science applications, can be found in [4]. Our 
interest in the definition is due to the fact that sets with zeta dimension zero are "negligible." 

An assembly sequence "a is locally deterministic [14] if (1) each tile added in "a binds with the 
minimum strength required for binding; (2) if there is a tile of type to at location I in the result 
of a, and tg and the immediate "OUT-neighbors" of to arc deleted from the result of "a , then no 
other tile type in T can legally bind at I; the result of ~a is terminal. 

Definition 3. Let T be a tile assembly system with temperature 2. We say T is almost-everywhere 
locally deterministic if T is locally deterministic except for a set of locations with zeta dimension 
zero. 

Informally, we can say that if T is almost-everywhere locally deterministic, then it computes deter- 
ministically, except for an "advice tileset" of negligible size. This is analogous to one definition of 
randomized computation, in which an oracle Turing machine computes deterministically, but has 
access to an infinite binary oracle string, for the purpose of generating coin flips. Since the set of 
decision locations has zeta dimension zero, we are finally able to state our main result, which is the 
following corollary of Theorem 1. 

Corollary 1. There is an almost-everywhere locally deterministic tile assembly system that self- 
assembles the statistically self-similar Sierpinski Triangle. 
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4 Conclusion 



We have demonstrated the existence of a tile assembly system that self-assembles a random fractal, 
the statistically self-similar Sierpinski Triangle. Our algorithm made use of novel mechanisms for 
information transfer and locally computed nested recursion. However, the tile assembly model used 
throughout this paper is error-free, and the algorithm presented is fragile with respect to errors. 
Any break in the paths of information-flow causes the entire construction to repeat incorrectly 
forever, or to halt. One direction for future investigation would be to construct a tile assembly 
system that builds a random fractal in a way that is robust against binding errors. 

Acknowledgements 

I am grateful to James Lathrop, Jack Lutz, Matthew Patitz and Scott Summers for helpful dis- 
cussions. Jack Lutz initially suggested this problem to me. During the preparation of this paper, I 
used the tile assembly simulation programs TileSimulator3.2.2.1 by James Lathrop, and ISU TAS 
by Matthew Patitz [8]. 

References 

[1] Becker, F., Rapaport, L, and Remila, E. Self-assemblying classes of shapes with a minimum number of 
tiles, and in optimal time. In FSTTCS (2006), S. Arun-Kumar and N. Garg, Eds., vol. 4337 of Lecture Notes 
in Computer Science, Springer, pp. 45-56. 

[2] Chen, H.-L., and Goel, A. Error free self-assembly using error prone tiles. In DNA (2004), C. Ferretti, 

G. Mauri, and C. Zaudrou, Eds., vol. 3384 of Lecture Notes in Computer Science, Springer, pp. 62 75. 
[3] Doty, D. Randomized sclf-asscmbly for exact shapes. Tech. Rep. 0901.1849, Computing Research Repository, 
2009. 

[4] Doty, D., Gu, X., Lutz, J. H., Mayordomo, E., and Moser, P. Zeta-dimension. In MFCS (2005), 
J. Jedrzejowicz and A. Szepietowski, Eds., vol. 3618 of Lecture Notes in Computer Science, Springer, pp. 283- 
294. 

[5] Kao, M.-Y., and Schweller, R. T. Randomized self-assembly for approximate shapes. In ICALP (1) (2008), 
L. Aceto, I. Damgard, L. A. Goldberg, M. M. Halldorsson, A. Ingolfsdottir, and I. Walukiewicz, Eds., vol. 5125 
of Lecture Notes in Computer Science, Springer, pp. 370-384. 

[6] Kautz, S. M., and Lathrop, J. I. Self-assembly of the discrete Sierpinski carpet and related fractals. In 

FNANO (2009). 

[7] Lathrop, J. I., Lutz, J. H., Patitz, M. J., and Summers, S. M. Computability and complexity in self- 
assembly. In CiE (2008), A. Beckmann, C. Dimitracopoulos, and B. Lowe, Eds., vol. 5028 of Lecture Notes in 
Computer Science, Springer, pp. 349-358. 

[8] Patitz, M. J. Simulation of self-assembly in the abstract tile assembly model with ISU TAS. In FNANO 
(2009). 

[9] Patitz, M. J., and Summers, S. M. Self-assembly of decidable sets. In UC (2008), C. S. Calude, J. F. Costa, 
R. Frcund, M. Oswald, and G. Rozenberg, Eds., vol. 5204 of Lecture Notes in Computer Science, Springer, 
pp. 206-219. 

[10] Patitz, M. J., and Summers, S. M. Self-assembly of discrete self-similar fractals. In DNA (2008). 

[11] Rothemund, p. W. K., and Winfree, E. The program-size complexity of self-assembled squares (extended 

abstract). In STOC (2000), pp. 459-468. 
[12] Seeman, N. C. De novo design of sequences for nucleic acid structural engineering. Journal of hiomolecular 

structure and dynamics 8, 3 (1990), 573-581. 
[13] SOLOVEICHIK, D., Cook, M., and Winfree, E. Combining self-healing and proofreading in self-assembly. 

Natural Computing 7, 2 (2008), 203-218. 
[14] SOLOVEICHIK, D., and Winfree, E. Complexity of self-assembled shapes. SIAM J. Comput. 36, 6 (2007), 

1544-1569. 

[15] Winfree, E. Algorithmic Self-Assembly of DNA. PhD thesis, California Institute of Technology, 1998. 
[16] Winfree, E. Algorithmic self-assembly of DNA; Theoretical motivations and 2d assembly experiments. Journal 
of Biomolecular Structure and Dynamics 11, S2 (2000), 263-270. 



10 



A Proof of Lemma 1 



We restate the lemma for clarity. 

Lemma 7. There is a tileset that self-assembles the first two stages of the sssST such that (1) 
each tile hinds with strength exactly 2; (2) the first two decisions of the coding sequence can he 
transmitted to each location in advance of a tile being placed there, so the properly colored tile is 
placed each time; and ( 3) once assembled, the configuration transmits north the sequence "decision 
1, decision 2, decision 1, " and transmits the same sequence east. 

Proof. Figures 7, 8 and 9 exhibit a tileset of 84 tiles that self-assembles the first two stages (4x4 
square) of the sssST, consistent with the order of placement and the information transfer graph in 
Figure 4. We can see this explicitly in Figure 10, which shows the order of placement, starting with 
the seed tile, of a 4 x 4 square with coding sequence prefix (12). 

B Crawlback tiles and the basecase method 

Recall that the crawlback tiles are placed "out of order" to build the southern row of a new Area 
1, or the western column of a new Area 2 or Area 3. We don't have access to the first decisions 
in the coding sequence prefix before we have to decide how to color the tile that binds to the 
crawlback location. Therefore, we make use of the fact that, when drawing the new base row of 
Area 1, or the base column of Areas 2 or 3, it is identical to the baserow (or leftmost column) of the 
configuration achieved so far. The purpose of these additional bits is to copy the original baserow 
(leftmost column) identically into the new area. This information can be transferred with constant 
size (1-bit) messages. We implemented this in tiles by appending "*on" or "*off" to the names of 
northern and eastern glues. Figure 11 illustrates the crawlback mechanism in more detail. 

There are two basecase submethods: one to construct 4x4 squares along the western edge 
of the fractal, and the other to construct 4x4 squares along the southern edge. The tilcscts that 
perform these submethods are identical, except that one is a rotation and reflection of the other. 
Figure 12 illustrates operation of the western edge basecase method given coding sequence prefix 
(12). Note that the tile types, and the order of tile placement, are very similar to the tileset used 
for the first two stages of construction {i.e., to prove Lemma 1). 

Technically, the basecase methods for Area 2 and Area 3 are not identical. The tile types that 
construct the baserow of Area 2 differ slightly from the ones that constructs the baserow of Area 
3, because of Area 2's transmission of the decision flag. We cover this in the next section. 

C Delimeters, popup tiles and popright tiles 

It is essential to know when to stop building a particular area — for example, when an area to mask 
is nested inside a larger area that is not completely masked out. We achieve this control by allowing 
a decision column to grow a popup tile only every other time that a delimeter encounters it as the 
assembly sequence progresses. When a decision tile binds at a decision point, it sends two pieces of 
information northward: the decision-value of the column, and the state that it should pop up when 
a delimeter encounters it. Once the popup tile binds, the column sends the decision- value north as 
before, but also a WAIT-TO-POP state, that is toggled back to POP by the next delimeter (and 
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Fig. 7. Combined witii Figures 8 and 9 tliis Figure exliibits a set of 84 tile types tliat implements self-assembly of 
the first two stages of the sssST. 
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Fig. 8. Combined witii Figures 7 and 9 tliis Figure exliibits a set of 84 tile types tliat implements self-assembly of 
the first two stages of the sssST. 
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Fig. 9. Combined witii Figures 7 and 8 tliis Figure exliibits a set of 84 tile types tliat implements self-assembly of 
the first two stages of the sssST. 
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Fig. 10. An implementation of the self-assembly of coding sequence prefix (12), using the tileset presented in Fig 
ures 7, 8 and 9. The red lines show the order of tile placement on the plane. 



the crawlback tile that binds to the decision column, just north of the location where the delimeter 
reaches the decision column). 

There are two rules placed on delimeters to prevent them from propagating for too long: a 
delimeter stops propagating once it sets a decision column back to the state POP; and a delimeter 
stops propagating if it encounters crawlback tiles. These rules keep a delimeter from extending 
beyond the Area it is intended to delimit. 

The delimeters as we have described them will correctly reproduce nested recursions for nested 
Area I's, and also permit the binding of a popup tile to a new decision column, to continue 
the next stage of self-assembly. To obtain nested recursions for Areas 2 and 3, we "rotate" these 
delimeter rules ninety degrees to the right, and reflect them across the x-direction. Instead of 
decision columns, we consider decision rows that bind "popright" tiles to the east when the proper 
delimeter encounters the decision row. (Such a "rotation" is technically the definition of a new 
group of tile types whose glue types are a rotated and reflected version of the appropriate tile types 
used in Area 1.) As before, these delimeters cannot cross crawlback tiles. In addition, a delimeter 
cannot cross from Area 3 to Area 2. That can be detected locally and with a constant-size message, 
by encountering the row encoded to produce a new decision point. 

Figure 13 provides an example of implementation of an Area 1 delimeter. 

While the tilesets to build Areas 2 and 3 perform essentially the same operations, there is an 
important difference between them: the placement of the decision tile that begins the next stage of 
construction. From stage four of construction onward, we pass a flag north from a decision tile, to 
the first popup tile, and then east along the base column of Area 3. This flag transmission is shown 
with red arrows in the right side diagram of Figure 6, and in Figure 15. We implemented this in 
tiles by including "dec" in the glue names responsible for decision flag transmission. 
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Fig. 11. This illustrates the crawlback tile mechanism after the third stage of the sssST has been constructed, and the 
fourth decision tile has bound to the decision point. The red arrow shows the order of tile placement. The gluename 
suffixes "*on" and "*off" encode the bits that determine whether a crawlback tile should be colored ON or OFF 
when it binds. The fourth decision in this diagram is that Area 2 will be invisible (encoded 12). The coding sequence 
prefix (before decision 4) is (123), and is transmitted to (and through) the crawlback tiles as they are placed. The 
basecase construction appears in Figure 12, and we fill in the "Start Area 3" section with tiles that are a rotation 
and reflection of the basecase construction. 
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Fig. 12. Illustration of the operation of the basecase method for columns a; = 1 through x = 4. We encoded a flag 
that the crawlback tiles had reached the column x — 4 via "Cf" (for "first corner"). The coding sequence prefix in 
this example is (12); the order of tile placement (shown by the red arrows) would remain the same, regardless of 
coding sequence prefix. 



We recognize it is time to attach a new decision tile to the configuration, when an Area 3 
delimeter that started with the decision row popright tile connects with the top row of Area 3. 
(See Figure 14.) This means that the construction of Area 2 must differ from Area 3, because some 
tiles in the baserow of Area 2 need to be able to receive input from the south. Also, it is important 
for the proof of correctness of the algorithm that every tile in the construction bind with strength 
exactly 2. Therefore, we modify the way the baserow tiles of Area 2 bind; unlike Area 3, they are 
not an exact translation and rotation of the basecase tiles of Area 1. Instead, in columns of form 
X = 2^ + 4m for m > a natural number, the tile that binds at location (x, 2'^ + 1) will have glue 
strength zero on its southern edge. Every fourth tile in the baserow of Area 2 binds with strength 
1 from the west, and strength 1 from the south, so the location can receive information from the 
south. Otherwise, the Area 2 basecase method is identical to Area 3's. 

For reasons of space, we used a 32 x 32 diagram in the main body of the paper to show the 
relation between popup and popright tiles. However, it may be more useful to consider Figure 15, 
which shows the construction after 65 x 65 tiles have been placed, because it shows both popup 
columns and popright rows that have "periods" 4, 8, and 16 — where a column's or row's period is 
the number of tiles from one popup or popright tile to the next. Just like popup columns, popright 
rows toggle between POP and WAIT-TO-POP states, and the state changes when a popright 
delimeter encounters the row. 

The critical fact that allows this nested recursion to be programmed in tile assembly is that 
no location needs to transmit (or recognize) more than two delimeters: a popup delimeter, and a 
popright delimeter. (There are two types of popright delimeter, one of which is used for placement 
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of the next decision tile.) Therefore, the entire recursion can be computed using a fixed number of 
tile types, and the underlying geometry of the configuration being built. 
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Fig. 13. An example implementation of popup tile delimeters. This is the delimeter created while building Area 2 
during stage three of construction (i.e., the northeast 4x4 grid of the first 8x8 squares). Area 2 is masked (invisible) 
in this example, which is why glue names begin with "I" . A tile passes a delimeter northward by using a "D" in the 
glue name, and eastward with a "DR" (for "delimeter right" ) in the glue name. Use of "K" in the glue name indicates 
we have built the upper right corner of the Area. The "dec" in the east glue name of the popup tile transmits the 
decision flag eastward. ("Dec" does not appear in the column below the popup tile, because this is the stage in which 
the decision flag is first created, so production of the decision flag is hard coded to begin at that point.) 



D Proof of correctness of Algorithm 1 

Definition 4. Let TL'^ he the integers greater than zero, and let S £ {1,2,3}°° be enumerated as 
S = {si, S2, S3, . . .) . The statistically self-similar Sierpienski Triangle relative to coding sequence S 
is the set T\S CI Z"*" x Z"^ with the following recursive definition. 

1. (1,1) G T\S. 

2. For j, k,x,y eN such that 2^~^ + 1 < x <2^ and 2^"^ + 1 < y < 2^, and m = max(j, k) 



(x,y) eT\S ^ < 



Srn / 1 and {x,y-2^-^) & T\S 



j <k 



Srn7^2 and{x-2^''\y-2''~^)eT\S j 



^ Sm 7^ 3 and {x — 2^ ,y) Ci T\S 



3 > k. 
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Fig. 14. This is a closeup of Area 3 from Figure 15, in order to highlight operation of the special delimeter (shown 
with a green arrow) , by means of which we know to add a new decision tile. The delimeter originating at the popright 
tile of the previous decision row is the one that cues the next decision row that construction of the current stage 
of Area 3 has finished. Note the technical point that the delimeter must travel east through the popright tile, and 
north through the popup tile, because of the order of tile placement: crawlback tiles are placed after popright tiles, 
not before. 
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Fig. 15. A schematic of popup tiles and popright tiles after the first six stages of construction. This shows a 65 x 65 
square, and includes the seventh decision tile (i.e., decision G), and the associated popup tile and crawlback tiles. 
The coding sequence prefix transmitted north appears at the top. An identical prefix (enumerated from south to 
north) is also transmitted to the eastern border of the configuration. 
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We now restate Definition 1 with respect to Definition 4. 

Definition 5. Let T be a tile assembly system. We say T self-assembles the statistically self-similar 
Sierpinski Triangle if the following conditions hold: 

Liveness condition: For every infinite ternary coding sequence S G {1,2,3}°°, there is an assem- 
bly sequence IxofT whose terminal assembly is T\S. 

Safety condition: If A is a terminal assembly ofT, then there is a sequence S € {1,2,3}°° such 
that A = T\S. 

Fairness condition: For any finite coding sequence prefix s G {1,2, 3}" of length n, the probability 
that a terminal assembly ofT will be a configuration of some T\S such that S extends s, is 3"". 

We are interested in being able to prove that a certain tile assembly system always achieves a certain 
output. In [14], Soloveichik and Winfree presented a strong technique for this: local determinism. 
An assembly sequence "a is locally deterministic if (1) each tile added in "a binds with the minimum 
strength required for binding; (2) if there is a tile of type to at location I in the result of a, and t^ 
and the immediate "OUT-neighbors" of to are deleted from the result of "a , then no other tile type 
in T can legally bind at Z; the result of "a is terminal. Local determinism is important because of 
the following result. 

Theorem 2 (Soloveichik and Winfree [14]). IfTis locally deterministic, then T has a unique 
terminal assembly. 

The objective of this section of the Appendix is to prove the following theorem. 

Theorem 3. Let T be a tile assembly system that executes Algorithm 1. Then T self-assembles 
the statistically self-similar Sierpinski Triangle. 

Proof. Suppose, for contradiction, that the Safety condition is violated, and there exists a tile 
assembly sequence "a with terminal assembly ^ of T such that A ^ T\S iov any S C {1,2,3}°°. 
Let At be the tile configuration obtained when the first tile in "a is placed such that no terminal 
assembly legally obtainable from At is of form T\S for any S G {1,2,3}°°. Let t be the final tile 
placed in At. Let k be the last stage built correctly before placement of t. From Lemma 2, and the 
characteristics of the popup and popright tile mechanisms discussed in Section C, we know that the 
information transfer graph before placement of t transmits a frontier of form {Sk-idkSk-idk+i) , 
where Sk^i is the coding sequence prefix of stage k — 1. 

We know from Lemma 6 that Algorithm 1 correctly colors a location, based on coding sequence 
trits received from its neighbors. The frontier transmitted from stage k is designed precisely so the 
tiles can execute a local algorithm that mimics the recursive definition of the sssST in Definition 4. 
So the only way t could be laid in a way that violates construction of some T\S in Line 8 of 
Algorithm 1, would be if there was already an error at some earlier stage. Since t is the first such 
error, that is not possible. We know from Lemmas 3, 4 and 5 that t cannot be placed in error at 
other points in the algorithm. Because each tile binds to the configuration with strength exactly 2, 
we are guaranteed a unique terminal assembly, if we fix which decision tiles bind at decision points, 
because of local determinism. So the Safety condition cannot be violated. 

For the Liveness condition to be violated, there would have to be some S such that T\S could 
never be built. Then there would be some minimal finite ternary string a such that cr C S". Since 
the Safety condition holds, this could only happen if one of the three decision tile types could not 
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bind at the |f7|-th decision point of the construction, which is contrary to the assumption that each 
tile type binds witli 1/3 probability at any location where a given tile type might bind. 

Since the Safety and Liveness conditions hold, and each decision point accepts the bond of a 
tile that encodes "1," "2" or "3" with equal probability, the Fairness condition holds as well. 

As a final note, a key direction for future research is the design of programming tools that make it 
possible to implement algorithms for nested recursion in macroscale and nanoscale self-assembly. 
We designed several tile assembly systems using the tile assembly simulators of Lathrop and Patitz. 
At first, this was to gain an intuition for programming self-assembling networks. Later, it was to 
verify the accuracy of technical aspects of the proofs we presented in this paper. We have not 
implemented the full construction of the sssST in tiles, and anticipate that our implementation 
would require a tile assembly system with approximately 5,000 tile types. The current state of the 
art in tile assembly programming (except for a handful of specific tasks) is to use a GUI to design 
each tile type individually [8]. We believe this implementation problem raises two critical — and 
wide-open — research areas: (1) lower bound proofs for the number of tile types required to execute 
recursions, and to store and transmit information; and (2) a shape compiler that can automate 
creation of tile types when given graphical input from a user. 
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